Dynamic Document Delivery: Generating Natural Language Texts on Demand

نویسندگان

  • Robert Dale
  • Stephen J. Green
  • Maria Milosavljevic
  • Cécile Paris
  • Karin M. Verspoor
  • Sandra Williams
چکیده

Research in natural language generation promises significant advances in the ways in which we can make available the contents of underlying information sources. Most work in the field relies on the existence of carefully constructed artificial intelligence knowledge bases; however, the reality is that most information currently stored on computers is not represented in this format. In this paper, we describe some work in progress where we attempt to generate large numbers of texts automatically from existing underlying databases. We focus here in particular on the automatic generation of descriptions of objects stored in a museum database, highlighting the difficulties that arise in using a real data source, and pointing to some possible solutions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Plagiarism checker for Persian (PCP) texts using hash-based tree representative fingerprinting

With due respect to the authors’ rights, plagiarism detection, is one of the critical problems in the field of text-mining that many researchers are interested in. This issue is considered as a serious one in high academic institutions. There exist language-free tools which do not yield any reliable results since the special features of every language are ignored in them. Considering the paucit...

متن کامل

Adding Syntax to Dynamic Programming for Aligning Comparable Texts for the Generation of Paraphrases

Multiple sequence alignment techniques have recently gained popularity in the Natural Language community, especially for tasks such as machine translation, text generation, and paraphrase identification. Prior work falls into two categories, depending on the type of input used: (a) parallel corpora (e.g., multiple translations of the same text) or (b) comparable texts (non-parallel but on the s...

متن کامل

Evaluating integrated NLP in foreign language learning: technology meets pedagogy

In this paper I would like to present the pedagogical perspective upon a 3-year research project with direct implications for the field of Foreign Language Teaching (FLT), and English for Professional Purposes (ESP) in particular. As a project in Computer Science, the basic aim of LARFLAST 1 was to investigate the integration of several innovative software components into a harmonized environme...

متن کامل

Discourse Factors in Multi-Document Summarization

The over-abundance of information today, especially online, has established the need for natural language technologies that can help the user find relevant information; multidocument summarization (MDS) and question answering (QA) are two examples. The requirement in MDS and openended QA to produce multi-sentential answers imposes the extra demand that the output of such systems be a coherent d...

متن کامل

Automatic Acquisition of Parallel Corpora from Websites with Dynamic Content

Parallel corpora are indispensable resources for a variety of multilingual natural language processing tasks. This paper presents a technique for fully automatic construction of constantly growing parallel corpora. We propose a simple and effective dictionary-based algorithm to extract parallel document pairs from a large collection of articles retrieved from the Internet, potentially containin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998